Software Re-Use And Evolution In Text Generation Applications
نویسندگان
چکیده
1 Introduction A practical goal for natural language text generation research is to converge on a separation of functions into modules that can be independently re-used. This paper addresses issues related to software re-use and evolution in text generation systems. We describe the benefits we obtained by adapting and generalizing the generation modules and techniques we used for the successive development of three distinct text generation applications, PLANDoc, FLOW-Doc, and ZEDDoc. We suggest that design principles such as the use of a common , modular pipeline architecture, a consistent and general data representation for-*nat, and domain-independent algorithms for generation subtasks, together with component re-use and adaptation, facilitate both application development and research in the field. In our experience, these principles led to significant reductions in development time for successive applications, from three years to one year to six months, respectively. They also enabled us to isolate domain-specific knowledge and devise re-usable, domain-independent algorithms for generation tasks such as ontological generalization and discourse structuring. who also played essential roles in the design and development of PLANDoc and FLOWDOC. Recent technological advances, such as the widespread use of the World Wide Web and ready access to a multitude of extensive large-scale databases, have created novel opportunities for practical text generation applications. At the same time, to take full advantage of these opportunities, text generation systems must be easily adaptable to new domains , changing data formats, and distinct underlying ontologies. One crucial factor contributing to the generalization and subsequent practical and commercial viability of text generation systems is the adaptation and re-use of text generation modules and the development of re-usable tools and techniques. In this paper, we focus on the lessons learned during the successive development of three text generation systems at Bellcore: PLANDoc (McKeown et al., 1994) summarizes execution traces of an expert system for telephone network capacity expansion analysis; FLOwDoc (Passonneau et al., 1996) provides summaries of the most important events in flow diagrams constructed during business re-engineering; and ZEDDoc (Passonnean et al., 1997) produces summaries of activity for a user-specified set of advertisements within a user-specified time period from logs of WWW page hits. We built FLowDoc and ZEDDoc by adapting components of the PLANDoc system. The transfer of the original PLANDoc modules to new domains led to the replacement of some hard-coded rules and ontological knowledge with more general, domain-independent components. This encapsula-tion, or "plug-and-play" feature, enabled …
منابع مشابه
Multilingual Natural Language Generation for Multilingual Software: A Functional Linguistic Approach
In this paper we present an implemented account of multilingual linguistic resources for multilingual text generation that improves significantly on the degree of re-use of resources both across languages and across applications. We argue that this is a necessary step for multilingual generation in order to reduce the high cost of constructing linguistic resources and to make NLG relevant for a...
متن کاملTheoretical Explanation of the Use of Cyberspace and the Evolution of Family Structure in Iran with Emphasis on the Concept of Generation Gap
The family is the vital source of peace and comfort, love and intimacy. But the family can also be a place of conflict, difference, gap and distance in terms of values and patterns of behavior between children and parents. Virtual social networks are a new generation of social networking space that at the end of the first decade of the 21st century have changed the ways of communic...
متن کاملImprovement of generative adversarial networks for automatic text-to-image generation
This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...
متن کاملA Bi-Objective Approach to an Assembly Line Re-Balancing Problem: Model and Differential Evolution Algorithms
Assembly lines are special kinds of production systems which are of great importance in the industrial production of high quantity commodities. In many practical manufacturing systems, configuration of assembly lines is fixed and designing a new line may be incurred huge amount of costs and thereby it is not desirable for practitioners. When some changes related to market demand occur, it is wo...
متن کاملThe Effect of Iranian EFL Learners’ Self-generated vs. Group-generated Text-based Questions on their Reading Comprehension
Reading comprehension is one of the most important skills, especially in the EFL context. One way to improve reading comprehension is through strategy use. The present study aimed at investigating the effect of question-generation strategy on learners' reading comprehension. The participants in the study were 63 intermediate students from three intact groups in Resa institute in Boukan, They we...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997